Adaptive Bug Classification for CVE List using Bayesian Probabilistic Approach
نویسندگان
چکیده
Software bug classification is a precondition for bug fixation and it plays a vital role in software maintenance. It is found that bug fixation often takes long due to the distribution of misclassified or non-classified bugs by the triager among the developers. In this paper, we propose an adaptive bug classification approach on CVE dataset that involves two Bayesian classifiers such as Naive Bayes and Bayes net, and takes adaptive decision for classification. Naive Bayes is a classification algorithm which adopts a naive approach regarding class conditional distribution during classification and assumes that all the features of a sample are conditionally independent given the class label. Bayes net is a graphical representation of a set of random variables and their conditional dependencies via a directed acyclic graph. It can be used for knowledge representation as well as classification. Exploiting the domain knowledge about the bug class, we conduct the experiments from two different view points group-based approach and general approach. In case of groupbased approach, both classifiers are learned using the bug groupspecific samples and selected features from five groups. In case of general approach, 28,266 bug samples and 64 bug features are considered. Experiments show that Bayes net classifier has more potential than Naive Bayes for classification with CVE dataset and therefore it is preferable. However, it is also found that Naive Bayes classifier performs fairly well in CVE bug classification due to its simplistic concept and less number of parameters.
منابع مشابه
A Probabilistic Bayesian Classifier Approach for Breast Cancer Diagnosis and Prognosis
Basically, medical diagnosis problems are the most effective component of treatment policies. Recently, significant advances have been formed in medical diagnosis fields using data mining techniques. Data mining or Knowledge Discovery is searching large databases to discover patterns and evaluate the probability of next occurrences. In this paper, Bayesian Classifier is used as a Non-linear dat...
متن کاملA Probabilistic Bayesian Classifier Approach for Breast Cancer Diagnosis and Prognosis
Basically, medical diagnosis problems are the most effective component of treatment policies. Recently, significant advances have been formed in medical diagnosis fields using data mining techniques. Data mining or Knowledge Discovery is searching large databases to discover patterns and evaluate the probability of next occurrences. In this paper, Bayesian Classifier is used as a Non-linear dat...
متن کاملA Probabilistic Model for COPD Diagnosis and Phenotyping Using Bayesian Networks
Introduction: This research was meant to provide a model for COPD diagnosis and to classify the cases into phenotypes; General COPD, Chronic bronchitis, Emphysema, and the Asthmatic COPD using a Bayesian Network (BN). Methods: The model was constructed through developing the Bayesian Network structure and instantiating the parameters for each of the variables. In order to validate the achiev...
متن کاملBug Classification: Feature Extraction and Comparison of Event Model using Naïve Bayes Approach
In software industries, individuals at different levels from customer to an engineer apply diverse mechanisms to detect to which class a particular bug should be allocated. Sometimes while a simple search in Internet might help, in many other cases a lot of effort is spent in analyzing the bug report to classify the bug. So there is a great need of a structured mining algorithm where given a cr...
متن کاملUsing Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013